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MEMBRANE-ASSOCIATED IMMUNOGENS OF MYCOBACTERIA 



Technical Field of the Invention 



The invention relates to membrane-associated 
polypeptides of mycobacteria and, in particular, the 
5 use of such polypeptides and the nucleic acids encoding 
them for use as vaccines and diagnostic reagents. 

Background of the Invention 

The mycobacteria are a diverse collection of acid fast, 
gram-positive bacteria, some of which cause important 
10 human and animal diseases. In humans, the two most 
common mycobacteria-caused diseases are tuberculosis 
(TB) and leprosy, which result from infection with M. 
tuberculos'is and M. leprae f respectively. 

Tuberculosis displays all of the principal 
15 characteristics of a global epidemic disease. 
Currently, tuberculosis afflicts more than 35 million 
individuals worldwide and results in over 4 million 
deaths annually. In India, at any given time, almost 
8 million people are reported to suffer from this 
2 0 disease and 500,000 deaths recorded. These figures may 
not cover the totality of those suffering from this 
disease in this country. Thus, tuberculosis appears to 
be a problem of major concern in India as also in many 
other countries of the world. 
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Tuberculosis is caused by M. tuberculosis , M. bovis . M.. 
af ricanum and M. microti, the acid-fast, Gram positive, 
tubercle bacilli of the family Mycobacteriaceae. Some 
local pathogenic strains of M. tuberculosis have also 
5 been isolated from patients in Madras and other cities 
in India, which differ in some respects from M. 
tuberculosis H37Rv, which is a virulent strain. 

In recent years, certain groups of individuals with 
AIDS have been found to have a markedly increased 

10 incidence of TB as well. It has now been shown that 
one group of mycobacteria which consists of M. avium . 
M. intracellular and M. scrof ulaceum . jointly known as 
MAIS complex, is responsible for disseminated disease 
in a large number of persons with AIDS (Kiehn et al., 

15 J . Clin. Microbiol. , 21:168-173 (1985); Wong et al., 
Amer. J. Med. . 78:35-40 (1985)). 

Since Koch identified M. tuberculosis as the causative 
agent of tuberculosis in 1882, many scientific studies 
and public health efforts have been directed at 

2 0 diagnosis, treatment and control of this disease. 
However, characteristics of M. tuberculosis have 
hampered research to improve diagnosis and to develop 
more effective vaccines. In addition, the biochemical 
composition of the organism has made identification and 

25 purification of the cellular constituents difficult, 
and many of these materials once purified, lack 
sensitivity and specificity as diagnostic reagents. As 
a result, diagnostic and immunoprophy lactic measures 
for mycobacterial diseases have changed little in the 

30 past half century. The conventional methods for the 
diagnosis of M. tuberculosis are troublesome and 
results are delayed. 

Bacillus Calmette-Guerin (BCG) , an avirulent strain of 
M. bovis (Calmette, A., Masson et Cie . Paris (1936)), 
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is used extensively as a vaccine against tuberculosis. 
Though numerous studies have found that it has 
protective efficacy against tuberculosis (Luelmo, F. , 
Am. Rev. Respir. Pis. , 125, 70-72 (1982)) BCG has 
5 failed to protect against tuberculosis in several 
trials (WHO, Tech. Rep. Ser. . 651:1-15 (1980)) for 
reasons that are not entirely clear (Fine, P., 
Tubercle . 65:137-153 (1984); Fine, et al., Lancet . 
(ii):499-502 (1986)). 

10 The eradication with vaccination, early diagnosis, and 
efficient therapy is an important objective of the 
drive to combat mycobacterioses . The lacunae in the 
present knowledge of the biology of these pathogens - 
their make-up, their natural history, their physiology, 

15 biochemistry and immunological reactivities, highlights 
the need for attempts to unravel their weaknesses, so 
that more efficient ways to combat this disease can be 
devised. To develop more effective tools for the 
diagnosis and prevention of these diseases, it is 

2 0 important to understand the immune response to 
infection by mycobacterial pathogens. The 
mycobacterial components that are important in 
eliciting the cellular immune response are not yet well 
defined. The antibody and T-cell responses to 

2 5 infection or inoculation with killed mycobacteria have 

been studied in humans and in animals. Human patients 
with TB or leprosy produce serum antibodies directed 
against mycobacterial antigens. Although antibodies 
may have some function in the antimycobacterial immune 

3 0 response, the exact function remains to be clarified 

since no protective role can be ascribed to these 
antibodies. Protection against mycobacterial diseases 
involves cell-mediated immunity. 

Mycobacteria do not produce any directly toxic 
3 5 substances and consequently their pathogenicity results 



from multiple factors involved in their interaction 
with the infected host. Intracellular parasitism 
probably depends on host cell trophic factors; it is 
conceivable that their short supply may be 
5 bacteriostatic and could play a role in the mechanism 
of mycobacterial dormancy. 

It is generally understood that protective immunity in 
mycobacterial infection is mediated by specific T cells 
which activate macrophages into non-specific 

10 tuberculocidal activity. Evidence suggests that gamma- 
IFN triggers macrophages towards H 2 0 2 -mediated 
bacterial killing, but related or other macrophage 
activating factor (MAF) molecules may also be involved. 
The causes responsible for the inadequate bactericidal 

15 function at sites of abundant T cell proliferation have 
not yet been explained. Dissociation between delayed- 
type hypersensitivity (DTH) and protective immunity led 
to views that T-cells of a distinct subset or 
specificity could be responsible for the acquired 

2 0 resistance to mycobacterial infection. Alternatively, 
interference with protection may result from corollary 
cellular reactions, namely by suppressor T-cells and 
macrophages, or from the shifting of T-cells towards 
helper function for B-cells. 

2 5 Unlike viral and some parasite pathogens which can 

evade host resistance by antigenic shift, mycobacteria 
have a resilient cell wall structure and can suppress 
host immune responses by the action of their 
immunomodulatory cell wall constituents. Whilst the 

3 0 success of protective immunization towards other 

microbial pathogens mainly depends on quantitative 
parameters of immunity, it appears that mycobacterial 
immunomodulatory stimuli produce a regulatory 
dysfunction of the host immune system. This may not be 
35 possible to override simply by more resolute 



-5- 



immunization using vaccines of complex composition such 
as whole mycobacteria (e.g. BCG) . Perhaps mycobacteria 
did not evolve potent "adjuvant" structures to boost 
the host immunity but rather to subvert host defenses 
5 towards ineffective cellular reactions operating to the 
advantage of the pathogen. Vaccination with an 
attenuated pathogen such as BCG could amplify further 
immune responses but with limited protection of the 
host, the potential scope for immunization with defined 
10 antigens is yet to be explored. 

The purification and characterization of individual 
antigenic proteins are essential in understanding the 
fundamental mechanism of the DTH reaction on the 
molecular level. The possible functional role of 

15 proteins of defined structure in the pathogenesis of 
mycobacterial diseases as well as for diagnostic 
purposes remains of great interest. Numerous groups 
have attempted to define mycobacterial antigens by 
standard biochemical and immunological technigues, and 

2 0 common as well as species specific antigens have been 
reported in mycobacteria (Minden, et al., Infect. 
Immun . , 46:519-525 (1984); Closs, et al., Scand. J. 
Immunol . , 4.2:249-263.(1980); Chaparas, et al., Am. Rev. 
Respir. Pis. . 122:533 (1980); Daniel, _ et al., 

25 Microbiol. Rev. , 42:84-113 (1978); Stanford, et al., 
Tubercle . 55:143-152 (1974); Kuwabara, S., J. Biol. 
Chem. . 250:2556-2562 .(1975)). 

Very little information about the mycobacterial genome 
is available. Initially, basic studies were conducted 

30 to estimate the genome size, G+C content and the degree 
of DNA homology between the various mycobacterial 
genomes (Grosskinsky, et al., Infect. Immun. , 57, 
5:1535-1541 (1989); Garcia, et al . , J. Gen. 
Microbiol. . 132:2265-2269 (1986); Imaeda, T., Int. J. 

35 Sys. Bacteriol. . 35, 2:147-150 (1985); Clark-Curtiss , 



et al., J. Bacteriol. . 161 3:1093-1102 (1985); Baess, 
I. et al., B. , Acta. Path. Microbiol. Scand. . (1978) 
86:309-312; Bradley, S. G. , Am. Rev. Respir. Pis. . 
106:122-124 (1972)). Recently, recombinant DNA 

5 techniques have been used for the cloning and 
expression of mycobacterial genes. Genomic DNA 
fragments of M. tuberculosis , M. leprae and some other' 
mycobacterial species were used for the construction of 
lambda gtll phage (Young, et al., Proc . Natl . Acad . 

10 Sci. f U.S.A., 82:2583-2587 (1985); Young, et al., 
Nature (London), 316:450-452 (1985)) or other vector- 
based recombinant gene libraries. These libraries were 
screened with murine monoclonal antibodies (Engers, et 
al., Infect. Immun. , 48:603-605 (1985); Engers, et al., 

15 Infect. Immun. , 51:718-720 (1986)) as well as 
polyclonal antisera and some immunodominant antigens 
were identified. The principal antigen among these 
being five 12, 14, 19, 65 & 71 kDa of M. tuberculosis 
(Young et al., Proc. Natl. Acad. Sci. , U.S.A., 82:2583- 

20 2587 (1985); Shinnick et al . , Infect. Immun. . 
55 (7) : 1718-1721 (1987); Husson and Young, Proc. Natl. 
Sc. Acad. . 84:1679-1683 (1987); and five 12, 18, 23, 36 
& 65 kDa antigens of M. leprae (Young, et al. , Nature 
(London), 316:450-452 (1985)). A few homologues of 

25 some of these antigens were also identified in some 
other mycobacterial species (e.g., BCG) (Yamaguchi et 
al., FEB 06511 . 240:115-117 (1988); Yamaguchi et al . , 
Infect. Immun. . 57:283-288 (1989); Matsuo, et al., J. 
Bacteriol. . 170, 9:3847- 3854 (1988); Radford, et al., 

30 Infect. Immun. . 56, 4:921-925 (1988); Lu, et al.. 
Infect. Immun. . 55, 10:2378-2382 (1987); Minden, et 
al., Infect. Immun. . 53, 3:560-564 (1986); Harboe, et 
al.. Infect. Immun. . 52, 1:293-302 (1986); Thole, et 
al., Infect. Immun. . 50, 3:800-806 (1985)). These 

35 antigens, however, are either intracellular or secreted 
molecules. 



-7- 

Al though M. bovis BCG has been widely used as a vaccine 
against tuberculosis, the determination of the 
membrane-associated polypeptides of mycobacterium that 
are capable of inducing a protective immune response is 
5 highly desirable. The use of such a membrane- 
associated polypeptide or the DNA encoding it provides 
for the generation of recombinant vaccines, e.g. , 
mycobacterial membrane-associated immunogens expressed 
in, for example, a virus or bacterium such as vaccinia 
10 virus, Salmonella, etc. used as a live carrier, or the 
display of non-mycobacterial immunogens on the surface 
of a cultivable mycobacterial strain which can be used 
as a live recombinant vaccine. 

Accordingly, it is an object herein to provide methods 
15 for identifying and isolating nucleic acids encoding a 
membrane-associated polypeptide of mycobacteria. 

Further, it is an object herein to provide membrane- 
associated polypeptides of mycobacteria and the nucleic 
acids encoding it. 

20 Still further, it is an object herein to provide 
vaccines utilizing all or part of the membrane- 
associated polypeptide of a mycobacterium or the DNA 
encoding such membrane-associated polypeptide. 

Still further, it is an object to provide reagents 
25 comprising said membrane-associated polypeptide with a 
mycobacterium or DNA encoding it useful in diagnostic 
assays for mycobacterial infection. 

Still further, it is an object to provide a promoter 
sequence comprising the promoter of said membrane 
30 associated polypeptide, which can direct gene 
expression in mycobacteria as well as in other 
microorganisms such as E. coli . 
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Summarv of the Invention 

In accordance with the foregoing objects, the invention 
includes compositions comprising nucleic acid encoding 
all or part of a membrane-associated polypeptide of a 
5 mycobacterium and the membrane-associated polypeptide 
encoded by said DNA. The membrane-associated 

polypeptide is characterized by the ability to detect 
an immune response to pathogenic mycobacteria or the 
mycobacteria from which the membrane associated 
10 polypeptide or part thereof is derived. Such 
mycobacteria include M. bovis , M. tuberculosis f M. 
leprae , M. af ricanum and M. microti . M. avium , M. 
intracellular and M. scrofulaceum and M. bovis BCG. 

A particular mycobacterial membrane-associated 
15 polypeptide is a 79 kD ion-motive ATPase. Extra- 
cellular, intra-cellular and transmembrane domains are 
identified in this mycobacterial membrane-associated 
polypeptide based upon its DNA and deduced amino acid 
sequence. 

20 The invention also includes vaccines utilizing all or 
part of a membrane-associated mycobacterial 
polypeptide or an expressible form of a nucleic acid 
encoding it. The invention also includes 

mycrobacterial promoter sequences capable of directing 

25 gene expression in mycobacteria as well as in other 
microorganisms such as E. coli . Such promoters are 
from mycobacterial genes encoding membrane-associated 
ATPases. A preferred promoter is that of the gene 
encoding the M. bovis BCG 79 kD membrane-associated 

3 0 polypeptide. This promoter sequence is especially 
useful to express genes of interest in mycobacteria. 
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Brief Description of the Drawings 

Figure 1 illustrates the results of immunoscreening of 
recombinant colonies carrying M. bovis BCG DNA (panel 
A) and M. tuberculosis H37Rv DNA (panel B) , using sera 
5 from TB patients in which the presence of M. bovis BCG 
antigens and M. tuberculosis H37Rv antigens capable of 
reacting with the antisera is indicated by a 
qualitative signal. 

Figure 2 shows the comparison of restriction site maps 
10 of recombinant clones carrying BCG DNA identified using 
the immunoscreening assay described herein (panel B) 
with the restriction site maps of five immunodominant 
antigens of M. tuberculosis and M. bovis BCG genomic 
DNAs , respectively, (Husson and Young, Proc. Natl. 
15 Acad. Sci. , U.S.A., 84:1679-1683 (1987); Shinnick et 
al., Infect. Immun. . 55:1718-1721 (1987) (panel A)). 
Restriction maps in each panel have been drawn to the 
same scale (indicated at the top) , and restriction 
sites are indicated above the restriction maps. The 

2 0 dotted line in panel A represents the noh-mycobacterial 

DNA. Restriction enzymes: B, BamHI , E, EcoRI, G, 
Bglll, K, Kpnl, p, Pvul, X, Xhol, H, Hindi , U, PvuII, 
Ps, PstI, Hi, Hindlll. In panel A, A is Sail and S is 
Sacl. In panel B, S is Sail. 

25 Figure 3 illustrates the results of Western blot 
analysis of the sonicated supernate of recombinant 
clone pMBB51A which carries a BCG DNA insert identified 
following immunoscreening of the recombinant colonies. 
The top panel shows reactivity of MBB51A (lane 2) and 

3 0 E. coli (lane 1) with sera from TB patients. The 

bottom panel (part A) shows reactivity of MBB51A (lanes 
1 and 2) and E. coli (lane 3) with anti-H37Rv sera 
raised in rabbits. Part B shows reactivity of MBB51A 
(lanes 1 and 2) and E. coli (lane 3) with the second 
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antibody alone. Arrows indicate the position of the 90 
kD immuno reactive BCG protein expressed by the 
recombinant MBB51A, which was absent in the negative 
control . 

5 Figure 4 illustrates the nucleotide sequence (Seq. ID 
No.: 1) of clone pMBB51A 3.25 kb insert DNA containing 
the M. bovis BCG immunoreactive MBB51A gene encoding an 
ion-motive ATPase, with a deduced molecular weight of 
79 kD. The deduced amino acid sequence (Seq. ID 
10 No.: 2) is shown below the nucleotide sequence. 
Upstream promoter elements are underlined. 
Transcription termination region is indicated by 
inverted arrows. 5' and 3' flanking regions are also 
shown . 

15 Figure 5 illustrates a schematic model derived for the 
79 kD protein encoded by pMBB51A which represents an 
ion-motive ATPase of BCG. The model considers only the 
structural and functional features that are prominent 
in the other ion-motive ATPase homologs of 

2 0 transmembrane domains of the protein. Functionally, 
important amino acid residues are indicated (P) , 
proline at position 400; (D) , aspartic acid at position 
443; (G) , glycine at position 521; and (A), -alanine at 
position 646. Numbers indicate amino acid residues 

25 broadly defining the limits of the transmembrane 
domains. 

Figure 6 illustrates the results of Southern blot 
hybridization of BamHI digest of genomic DNAs from EJ. 
bovis BCG (lane 6) , M. tuberculosis H37Rv (lane 5) , J4. 
30 smeqmatis (lane 4) and M. vaccae (lane 3 using pMMB51A 
DNA insert (lane 8) as probe. Panel A shows ethidium 
bromide stained gel and panel B shows the results of 
Southern blot hybridization. 
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Detailed Description of the Invention 

As used herein, a "membrane-associated polypeptide" of 
a mycobacterium is defined as any Mycobacterial 
membrane-associated polypeptide which is capable of 
5 detecting an immune response against the wild-type 
mycobacterium containing the membrane-associated 
polypeptide. However, based upon the observed cross- 
reactivity of the 79 kD membrane-associated polypeptide 
of an M. bovis BCG with pooled anti-sera from patients 

10 afflicted with tuberculosis and the cross-hybridization 
as between the DNA encoding the 79 kD membrane- 
associated polypeptide and the DNA of M. tuberculosis 
H37Rv, the membrane-associated polypeptide of the 
invention is not limited to that identified herein from 

15 M. bovis BCG. Rather, it encompasses not only homologs 
to the 79 kD ion-motive ATPase but also any and all 
membrane-associated polypeptides of a mycobacterium 
that can be used to detect an immune response by the 
same or a different mycobacteria in which the membrane- 

20 associated polypeptide is normally found. 

As used herein, "nucleic acid" includes DNA or RNA as 
well as modified nucleic acid wherein a detectable 
label has been incorporated or wherein various 
modifications have been made to enhance stability, 

25 e.g., incorporation of phosphorothioate linkages in the 
phosphoribose backbone, etc. Such nucleic acid also 
includes sequences encoding the anti-sense sequence of 
the DNA encoding the membrane-associated polypeptide 
such that the now well-known anti-sense technology can 

30 be used to modulate expression of such membrane- 
associated polypeptides. 

In some aspects of the invention, the nucleic acid 
sequence encoding all or part of a membrane-associated 
polypeptide of the mycobacterium is used as a vaccine. 
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When so-used the nucleic acid is generally an 
"expressible nucleic acid" that contains all necessary 
expression regulation sequences to control 
transcription and translation of the nucleic acid in a 
5 designated host system. In some vaccine embodiments, 
the DNA encodes a chimeric polypeptide containing at 
least one transmembrane domain of the membrane- 
associated polypeptide and an "immunogenic 
polypeptide". The transmembrane domain is used to 

10 display the immunogenic polypeptide on the surface of 
a particular host organism such as an attenuated live 
vaccine. When the membrane-associated polypeptide 
includes more than one transmembrane region, one or 
more of the transmembrane regions can be used with an 

15 immunogenic polypeptide. Thus, for example, the 79 kD 
ion-motive ATPase as shown in Figure 5 has at least 
three extracellular domains into which an immunogenic 
polypeptide can be engineered by well-known methods 
involving recombinant DNA technology. Although it is 

20 preferred that more than one transmembrane region be 
used to display an immunogenic polypeptide, one skilled 
in the art can readily vary the length of such a 
membrane-associated polypeptide to maximize an 
immunogenic response or to minimize the amount of 

25 membrane-associated polypeptide used - in such 
applications. 

As used herein, "immunogenic polypeptide" comprises all 
or part of any polypeptide which can potentially be 
utilized in a vaccine or diagnostic application. Thus, 

30 the immunogenic polypeptide can comprise heterologous 
immunogens, i.e., immunogens from non-mycobacterial 
sources, e.g. , Salmonella or Shigella or from different 
mycobacteria from which the membrane-associated 
polypeptide is derived, e.g. , immunogens from 

35 Mycobacterium tuberculosis fused to a membrane- 
associated polypeptide from M. bovis BCG. However, in 
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some instances homologous immunogens can be used. For 
example, each of the extracellular domains as set forth 
in Figure 5 herein can be combined and displayed by 
combination with one or more of the transmembrane 
5 domains from the membrane-associated polypeptide 
normally containing them- Alternatively, the 

intercellular domains can be displayed extracellularly 
using appropriate transmembrane regions from the same 
molecule. 

10 In an alternate vaccine embodiment, all' or part of the 
membrane-associated polypeptide of mcobacteria, rather 
than the DNA encoding, is used as part of a vaccine. 
Such proteinaceous vaccines are formulated with well- 
known adjuvants and administered following well- 

15 established protocols known to those skilled in the 
art. 

In still other embodiments, the nucleic acid encoding 
the membrane-associated polypeptide of the invention 
can be used as a diagnostic for detecting infection 

20 based upon hybridization with wild-type genes contained 
by the infectious mycobacterium. Such detection can 
comprise direct hybridization of DNA extracted from an 
appropriate diagnostic sample or PCR amplification 
using the nucleotide sequence of the nucleic acid 

25 encoding the membrane-associated polypeptide of the 
invention to prime amplification. If PCR amplification 
is primed in a conserved region the presence of 
mycobacteria in a diagnostic sample can be determined. 
If primed in a non-conserved region which is species 

30 specific the diagnostic assay determined the specific 
mycobacterium causing an infection. 



In addition, the membrane-associated polypeptide of the 
invention can also be used to detect the presence of 
antibodies in the sera of patients potentially infected 
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with mycobacteria. Such detection systems include 
radioimmunoassays and various modifications thereof 
which are well-know to those skilled in the art. In 
addition, the membrane-associated polypeptide of the 
5 invention can be used to detect the presence of a cell- 
mediated immune response in a biological sample. Such 
assay systems are also well-known to those skilled in 
the art and generally involve the clonal expansion of 
a sub-population of T cells responding to stimuli from 
10 the membrane-associated polypeptide. When so-used, the 
humoral and/or cell-mediated response of a patient can 
be determined and monitored over the course of the 
disease. 

Recombinant clones encoding immunogenic protein 

15 antigens of M. bovis BCG have been isolated from a 
genomic library of M. bovis BCG DNA. In particular, 
DNA fragments encoding four protein antigens of M. 
bovis BCG have been isolated by probing a pBR3 2 2 
library of M. bovis BCG DNA with sera from TB patients, 

20 absorbed on E. coli . Restriction site maps of these 
four recombinant clones are different from those of the 
five immunodominant antigens of mycobacteria (Young, et 
al., Proc 1 Natl. Acad. Sci. . U.S.A., 82:2583-2587 
(1987) ; Husson and Young, Proc. Natl. Acad. Sci. . 

25 U.S.A., 84:1679-1683 (1987); Shinnick et al . , Infect. 
Immun . , 55:1718-1721 (1987)), thereby indicating that 
these cloned protein antigens are novel. One of the 
recombinant DNA clones encoded an immunoreactive 
protein with apparent molecular weight of 90 kD as 

3 0 determined by Western blot analysis. The complete 
nucleotide sequence of the insert DNA of this clone was 
determined. This clone was found to carry a 
mycobacterial promoter and a monocistronic ORF encoding 
a protein of 761 amino acids with a deduced molecular 

3 5 weight of 79 kD. This 79 kD protein had extensive 
homology with ion-motive ATPases of S. f aecalis (Solioz 
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et al., J. Biol, chem , 262:7358-7362 (1987)), E . coli 
(Hesse et al., Proc. Natl. Acad. Sci. . U.S.A., 81:4746- 
4750 (1984)) and several other organisms, and thus, 
represents an ion-motive ATPase or a putative K+ATPase 
5 of BCG. Using computer algorithms, this ion-motive 
ATPase was determined to be a membrane protein and has 
a homologue in M. tuberculosis H37Rv, which is 
pathogenic in humans, but not in M. vaccae and J4/ 
smecrmatis , which are non-pathogenic. As a result, 

10 novel BCG immunogens can be available which can be 
useful in the prevention, diagnosis and treatment of 
tuberculosis and other mycobacterial infections. They 
can be used, for example, in the development of highly 
specific serological tests for screening patients for 

15 individuals producing antibodies to M. tuberculosis , or 
those infected with M. tuberculosis , in the development 
of vaccines against the disease, and in the assessment 
of the efficacy of the treatment of infected 
individuals. 

2 0 Further, based on the nucleotide sequence of the 
pMBB51A insert DNA, appropriate oligonucleotide primers 
can be used for PCR amplification using as template M. 
bov is BCG or M. tuberculosis H37Rv DNA. Such a PCR 
amplification scheme can be thus useful for the 

25 detection of mycobacterial DNA in a given sample. 
Further, by a judicious choice of the primer design, 
such an amplification procedure can be adapted for 
taxonomic classification of mycobacterial DNAs. For 
example, using primers to flank a heavily conserved 

30 region such as the ATP-binding site, PCR amplification 
is common to all mycobacterial species, whereas using 
primers from non-conserved areas, amplification can be 
made species specific. 
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Example I 

Isolation and Characterization of Genes 
Encoding Iromogenic Protein Antigens 
of Myc ob a ct er ium bovis BCG 
5 and Mycobacterium tuberculosis H37R 

A. Construction of Recombinant DNA 

Libraries of M. bovis BCG DNA and 
Mycobacterium Tuberculosis H37Rv 

A recombinant DNA library of M. bovis BCG genomic DNA 

10 was constructed using pBR322 a high copy number plasmid 
vector (Bolivar, et al., Gene , 2:95-113 (1977)) with 
antibiotic markers (ampicillin and tetracycline) and 
several unique cloning sites. M. bovis BCG cells were 
harvested from a culture in late logarithmic phase of 

15 growth and high molecular weight DNA was isolated by 
the procedure of (Eisenach, et al., J. Mol . Biol . , 
179:125-142 (1986)) with slight modifications . BCG DNA 
was digested to completion with BamH I and shotgun 
cloning of these fragments into the BamH I site of 

20 pBR322 was performed- The genomic library was 
transformed into E. coli strain DHI and recombinants 
were scored on the basis of ampicillin resistance and 
tetracycline sensitivity- The aim of this approach 
was to generate restriction fragments of a broad size 

25 range so as not to restrict the library to DNA 
fragments of any particular size range. This cloning 
strategy also ensured to a large extent that any 
recombinants selected for expression of mycobacterial 
antigens should be likely to drive expression from a 

3 0 mycobacterial promoter rather than the Tet promoter of 
pBR3 22. 

The BCG library constructed in this manner contained 
2051 clones of BCG origin. In an analogous manner, a 
genomic library of Mycobacterium tuberculosis H37Rv DNA 
35 was constructed and 1100 clones obtained. 
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The BCG DNA inserts ranged in size from 0.9 to 9.5 kb. 
The average size of the mycobacteria DNA fragments 
inserted in pBR322 was estimated to be about 4 kb. 
Given the genome size of BCG to be 4.5 x 10 3 kb 
5 (Bradley, S. G. , J. Bacteriol . . 113:645-651 (1973); 
Imaeda, et al. f Int. J. Syst. Bacteriol. , 32, 456-458 
(1982)), about 1000 clones of this average insert size 
would represent comprehensively the entire genome of' 
the microorganism. 

10 B. Isolation of Recombinant DNA Clones Encoding 
BCG Mycobacterium bovis BCG and Mycobacterium 
tuberculosis H37Rv Protein Antigens 

In order to identify recombinants expressing 
mycobacterial antigens, a colony immunoscreening assay 

15 (CIA) to screen recombinant colonies with appropriate 
antisera, was established. Sera obtained from 20 
patients newly diagnosed with active pulmonary 
tuberculosis were pooled for use in immunoscreening. 
None of the patients had received treatment for 

2 0 tuberculosis prior to this study and their sputa were 
positive for acid fast bacteria in all cases. Pooled 
sera were absorbed on a E. coli sonicate overnight at 
4°C, to eliminate antibodies cross-reactive to E. coli 
antigens, thereby improving signal to noise ratio 

25 during the immunoscreening. 

Individual recombinant colonies were grown overnight on 
nitrocellulose membranes and immunoscreening was 
carried out as described with slight modifications. 
The colonies were lysed in chloroform vapor to release 
30 the cloned mycobacterial antigens, immobilized on the 
nitrocellulose paper. The immobilized antigens were 
reacted with TB sera and binding of the antibody was 
revealed by standard procedures using a horseradish 
peroxidase-protein A detection system. The signals 
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obtained with the recombinant clones were compared with 
that obtained in case of E. coli colonies harbouring 
pBR322 vector alone, which served as the negative 
control, to assess the signal to noise ratio. Further, 
to ascertain whether the immunoreactivity of the 
recombinant clones was due to anti-mycobacterial 
antibodies or due to a reaction with normal serum- 
components, another CIA of the selected recombinants 
was performed using TB sera and normal human sera NHS 
which had been absorbed on E. coli in a manner 
analogous to that described earlier for TB sera. Only 
those clones reacting selectively with TB sera and not 
with NHS, were considered to be unambiguously 
suggestive of the presence of mycobacterial antigens. 
The use of this immunoscreening approach to identify 
recombinant colonies carrying mycobacterial DNA inserts 
capable of expressing mycobacterial antigens is 
described below: 

Figure 1 shows the result of immunoscreening of 
recombinant colonies carrying M. bovis BCG DNA (panel 
A) or M. tuberculosis H37 Rv DNA (panel B) using sera 
from TB patients. The colonies were grown on 
nitrocellulose paper' overnight, lysed to release the 
cloned mycobacterial antigen and allowed to react with 
the antibodies. The presence of mycobacterial antigen 
is indicated by a qualitative signal in the recombinant 
clones which is absent in the negative control 
comprising colonies harbouring pBR3 22 vector alone. A 
similar assay was repeated with normal human serum to 
ascertain the specificity of the cloned mycobacterial 
antigens. 51 recombinant colonies carrying M. bovis 
BCG DNA inserts and 45 recombinant colonies carrying J*. 
tuberculosis H37Rv DNA inserts were screened by the 
above procedure; 14 clones of BCG origin (panel A) and 
2 clones of H37Rv origin (panel B) exhibited distinct 
strong signals indicating the immunoreactivity of these 
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clones (Fig. 1) . All these clones were also tested for 
immunoreactivity with NHS. However, with the exception 
of 3 clones which showed a slight reactivity to NHS, 
none of the clones reacted with NHS, thereby indicating 
5 that these expressed mycobacterial antigens reacted 
selectively with TB sera. Thus, this procedure 
resulted in the forthright identification of 
recombinant clones encoding mycobacterial antigens. 
This strategy can be generally applicable to 
10 mycobacterial gene banks prepared in plasmid or cosmid 
vectors to identify genes which are expressed in E. 
coli at least to the limit detectable by the 
immunoassay. 

C. Restriction Mapping of Immunoreactive 
15 Mycobacterium bovis BCG DNA Recombinants 

The insert DNAs of four of the immunoreactive BCG 
recombinant DNA clones isolated using the TB sera were 
mapped with restriction endonucleases . Figure 2, panel 
B, shows the genomic DNA restriction site maps deduced 

2 0 for the cloned BCG DNA in four recombinants, in which, 

A represents Sal I, B, BamH I, E , EcoR I, G, Bgl II, K, 
Kpn I, P, Pvu I, S, Sac I, X, Xho I. These restriction 
site maps were then compared with those constructed 
previously for the five immunodominant antigens of 
25 tuberculosis/M. bovis BCG (Young, et al., Proc. Natl. 
Acad. Sci. . U.S.A., 82:2583-2587 (1985); Husson, et 
al., Proc. Natl. Acad. Sci. . 84:1679-1683 (1987); 
Shinnick, et al., Infect. Immun. . 55, 7:1718-1721 
(1987)) (Figure 2, panel A). Since the restriction 

3 0 site maps shown in panels A and B have been drawn to 

the same scale, the differences between the two are 
apparent. There are no regions of similarity between 
the restriction site maps of immunoreactive BCG 
recombinant clones and those of the previously 
3 5 characterized immunodominant antigens of M. 
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tuberculosis/ M. bovis BCG. Therefore, one can conclude 
that the cloned BCG DNA inserts in the four 
recombinants are novel. 

Example II 

5 Isolation and Characterization 

of a Gene Encoding a BCG Ion-motive ATPase 

A. Identification of a Novel BCG Antigen 

One of the four immunoreactive BCG clones, pMBB51A, 
revealed the presence of a protein of Mr 90 kD, on 

10 Western blot analysis using TB sera as well as anti- 
H37Rv polyclonal antiserum raised in rabbits (Figure 
3). Similar Western blot analysis of pMBB51A with a 
pool of a few anti-mycobacterial monoclonal antibodies 
(TB 23, TB 71, TB 72, TB 68, TB 78; Engers et al., 

15 Infec. Immun. , 48:603-605 (1985)) or with normal human 
sera did not reveal this immunoreactive protein of 90 
kD. This confirms that pMBB51A encodes a BCG antigen 
which is different from those identified previously in 
BCG, thereby making it a novel antigen. 

2 0 B. Determination of the 

Nucleotide Sequence of pMBB51A 

In order to further characterize this novel BCG 
antigen, pMBB5lA DNA insert was subjected to nucleotide 
sequencing. The BamH I-BamH I insert carried in 
25 pMBB51A was mapped for additional restriction enzyme 
cleavage sites. It was determined that there were at 
a minimum a single Pst I site and 3 Sal I sites in this 
sequence. Overlapping fragments derived from single 
and double digests of Sal I, BamH I and Sal I, BamH I 

3 0 and Pst I, and Pst I and Sal I, were subcloned into 

M13mpl8 and M13mpl9 vectors, in preparation for DNA 
sequence analysis. DNA sequencing was then carried out 
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using commercially available kits such as the Seguenase 
system and the T7 system from Pharmacia. 
Oligonulceotides derived from the determined sequence 
were synthesized and used as primers to complete the 
5 sequence of the larger inserts. Several areas of 
compression were encountered during the sequencing and 
these were resolved by using dITP in the sequencing 
reactions, and by changing the reaction conditions. 
The complete nucleotide sequence of the pMBBSIA insert 
10 DNA was determined by sequencing both the strands using 
dGTP as well as dITP. The DNA sequence of the pMBB5lA 
insert was determined to be 3.25 kb long with a GC 
content of 67 . 1% and is shown in Figure 4 . 

The determination of the DNA sequence of the 3.25 kb 
15 insert of clone pMBB51A (Figure 4) permitted the 
elucidation of the amino acid sequence of the 9 0 kD BCG 
antigen. In Figure 4, nucleotides are numbered from 
the left end of the pMBB51A insert DNA. 

A search of pMBB51A insert DNA sequence for possible 

2 0 ORFs in all three reading frames revealed the longest 

ORF of 2286 bp encoding a polypeptide of 761 amino 
acids on one of the strands. The other strand was 
found to have a smaller URF of 1047 bp capable of 
encoding a polypeptide of 349 amino acids. The longest 
25 ORF encoding a 761 amino acid long protein corresponded 
to a deduced molecular weight of 79 kD which came 
closest to the immunoreactive BCG protein with apparent 
molecular weight of 90 kD, seen on the Western blot. 
The deduced amino acid sequence for this protein is 

3 0 given below the nucleotide sequence in Figure 4. 

The location of this ORF on the pMBB51A insert DNA was 
such that there were long stretches of flanking DNA 
sequences, devoid of any meaningful ORFs, present on 
either side. This precluded the expression of this ORF 



using commercially available kits such as the Seguenase 
system and the T7 system from Pharmacia. 
Oligonulceotides derived from the determined sequence 
were synthesized and used as primers to complete the 
sequence of the larger inserts. Several areas of 
compression were encountered during the sequencing and 
these were resolved by using dITP in the sequencing 
reactions, and by changing the reaction conditions. 
The complete nucleotide sequence of the pMBB51A insert 
DNA was determined by sequencing both the strands using 
dGTP as well as dITP. The DNA sequence of the pMBB5lA 
insert was determined to be 3.25 kb long with a GC 
content of 67.1% and is shown in Figure 4. 

The determination of the DNA sequence of the 3.25 kb 
insert of clone pMBB51A (Figure 4) permitted the 
elucidation of the amino acid sequence of the 90 kD BCG 
antigen. In Figure 4, nucleotides are numbered from 
the left end of the pMBB51A insert DNA. 

A search of pMBB51A insert DNA sequence for possible 
ORFs in all three reading frames revealed the longest 
ORF of 2286 bp encoding a polypeptide of 761 amino 
acids on one of the strands. The other strand was 
found to have a smaller URF of 1047 bp capable of 
encoding a polypeptide of 349 amino acids. The longest 
ORF encoding a 761 amino acid long protein corresponded 
to a deduced molecular weight of 79 kD which came 
closest to the immunoreactive BCG protein with apparent 
molecular weight of 90 kD, seen on the Western blot. 
The deduced amino acid sequence for this protein is 
given below the nucleotide sequence in Figure 4. 

The location of this ORF on the pMBB51A insert DNA was 
such that there were long stretches of flanking DNA 
sequences, devoid of any meaningful ORFs, present on 
either side. This precluded the expression of this ORF 
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from the pBR3 22 Tet gene promoter and instead suggested - 
that this ORF was being expressed from its own promoter 
in pMBB51A. This also suggested that E. coli may 
correctly utilize the M. bovis BCG transcription and 
5 translation start and stop sites in this gene. 

Immediately upstream of the ORF, regulatory sequences 
closely matching the -35, -10 and Shine-Dalgarno' 
sequences of E. coli , (Rosenberg, et al., Annul. Rev. 
Genet. . 13:319-353 (1979)) were identified. The 

10 spacing between these three regulatory motifs was also 
very well conserved. Although the other mycobacterial 
promoters sequenced (Dale, et al.. Molecular Biology of 
the Mycobacteria , chap. 8, 173-198 (1990)) show some 
differences from the E. coli consensus sequences in all 

15 the three regions -35, -10 and SD, the regulatory 
elements of pMBB51A DNA showed a maximum degree of 
sequence identity with E. coli in the -3 5 and SD 
sequence elements with a single mismatch in each 
element, and about 50% sequence identity in the Pribnow 

2 0 box. All the above features clearly indicated that 

this region is the promoter region for the 
mycobacterial gene contained in pMBB5lA. The extent of 
similarity between this BCG promoter sequence and a 
typical E. ' coli promoter is remarkable and explains the 
25 functional activity of this promoter, unlike many other 
mycobacterial promoters, in E. coli . The translation 
initiation codon in this ORF was ATG at position 508 
while a single translation termination codon TGA was 
identified at position 2790. Potential transcription 

3 0 termination structures capable of forming stem and loop 

conformations were identified in the region 3' to this 
ORF. The pMBB5 1 A ORF thus represented a monocistronic 
gene rather than an operon. The promoter region of 
MBB51A gene is capable of directing gene expression in 
35 E. coli as well as in mycobacteria. This promoter 
sequence is useful for directing expression of 
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mycobacterial genes in E. coli. Further , this promoter 
sequence can also be used to express homologous and/or 
heterologous genes in a mycobacterium, thus providing 
a key element for the development of gene expression 
5 systems in mycobacteria. 

In order to derive information about the possible 
biological function of the MBB51A protein, the amin'o 
acid sequence of this protein was used to search for 
homology against available sequences in the PIR Protein 

10 Database Release 20 (Table I) and a Genebank Nucleic 
Acid Database (Table II) using the Fast A suite of 
programmes written by (Lipman and Pearson, Proc. Natl. 
Acad. Sci. . USA, 85:2 (1988)). The MBB51A protein 
sequence exhibited homology to a family of ion-motive 

15 ATPases from different organisms, ranging from bacteria 
to mammals. The 13 best scores from a search with 
ktuple 2 are shown in the upper panel of Table I and 10 
best scores from a search with ktuple 1 are shown in 
the lower panel. In each case, MBB51A protein 

20 exhibited maximum homology (75.9% homology in a 593 
amino acid overlap with 31.9% identity to a K+ 
transporting ATPase of S. f aecalis (Solioz et al . , 
1987) . The next best homology was observed with the B- 
chain of K+ transporting ATPase of E. coli (Hesse, et 

25 al., Proc. Natl. Acad. Sci. , U.S.A., 81:4746-4750 
(1984)) (68.8% homology in a 397 amino acid overlap 
with 24.2% identity) . A lesser extent of homology was 
also seen with H+, Ca++ and Na+-ATPases from different 
organisms. The results of homology search thus 

30 indicated that MBB51A protein is an ion-motive ATPase 
of M. bovis BCG and is closely related to the other 
bacterial ion-motive ATPases. This is the first report 
of the cloning and identification of such an ATPase in 
mycobacteria. The BCG ion-motive ATPase showed 

35 homologies with other ion-motive ATPases with 
overlapping regions ranging in size from 59 3 amino 
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acids in case of S. f aecalis to 82 amino acids as in 
case of L. donovani , (Meade, et al., Mol. Cell Biol. , 
7, 3937-3946 (1987)), though most of the regions of 
sequence identity or conservation were localized in the 
5 C- terminal half of the MBB51A protein. Further, a 
region of 30 amino acids in the C-terminal half of 
MBB51A protein was found to be shared with most of 
these ATPases, thereby suggesting the functional 
importance of this region. Detailed alignment of 
10 MBB51A protein with the K+ ATPases of S. faecal is and 
E. coli also indicated that several residues were 
conserved between the three ATPases, including the ones 
that are invariant in all ATPases from bacteria to man. 



TABLE I 



15 RESULTS OF HOMOLOGY SEARCH OF MBB51A 

AMINO ACID SEQUENCE AGAINST PIR PROTEIN DATABASE 



LOCUS 


SHORT DEFINITION 




opt 


>A29576 


Potassium - transporting ATPase Streptococcus 


547 


7 92 


>PWECBK 


Potassium - transporting ATPase, f chain - E.coli 


314 


270 


>A25939 


Proton - transporting ATPase - Neurospora 


168 


186 


>A25823 


Proton - transporting ATPase - Yeast 


166 




>PWRBFC 


Calcium,- transporting ATPase, fast twitch skele 


152 


158 


>PWRBSC 


Calcium - transporting ATPase, slow twitch skele 


135 


157 


>A25344 


Potassium - transporting ATPase - Rat 


78 


155 


>RDEBHA 


Mercuric reductase -Shigella flexneri plasmid 




142 


>RDPSHA 


Mercuric reductase (transposon TnSOl) 


74 


124 


>RGPSHA 


Mercuric resistance operon regulatory p 


79 


109 


>A2A639 


Sodium/potassium-transporting ATPase, alpha 


92 


82 


>A24414 


Sodium/potassium-transporting ATPase, alpha 


92 


82 


>B24862 


Sodium/potassium-transporting ATPase, beta 


83 • 


82 



The PTR protein data base (2378611 residues in 9124 sequences) was 
scanned with the FASTA program. The mean of the original initial 
score was 27.2 with a standard deviation of 6.9. Initial scores 
35 (initn) higher than 75.6 are 6 standard deviations above the 
average, a level of significance that usually indicates biological 
relatedness. Optimization (opt) generally will improve the 
initial score of related proteins by introducing gaps in the 



sequence. Unrelated sequences usually do not have their scores 
improved by optimization. 



ktuple : 1 

>A29576 potassium-transporting ATPase - Streptococcus 744 

>FWECBK potassium-transporting ATPase. 6 chain - Esche 386 

>A25939 Proton -transporting ATPase - Neurospora crassa 310 

>A25823 proton-transporting ATPase -Yeast (Saccharomy) 317 

>B24639 Sodium/potassium- transporting ATPase, alpha <+ 158 

>A24639 Sodium/potassium-transporting ATPase, alpha ch 175 

>C24639 Sodium/potassiura-transporting ATPase, alpha <II 192 

>PWRBFC Calcium-transporting ATPase, fast twitch skele 240 

>FWSHNA Sodium/potassium-transporting ATPase, alpha skele 214 

>A24414 Sodium/potassium-transporting ATPase, alpha chain 214 



RESULTS OF HOMOLOGY SEARCH OF MBB51A AMINO ACID SEQUENCE 
AGAINST GENBANK NUCLEIC ACID SEQUENCE DATABASE 



ktuple : 2 

LOCUS 

>STRATPK 

>ECOKDPABC 

>YSPPMA1A 

>NEUATPASE 

>NEUATPPM 

>YSCPMA1 

>M17889 

>M12898 

>RABATPAC 

>NR1MER 



SHORT DEFINITION 

S.faecalis K+ ATPase, complete cds. 

E.coli kdpABC operon coding for Kdp-ATpase 

S.pombe H+ ATPase, complete cds. 

H. crassa plasma membrane ATPase, complete 

Neurospora crassa plasma membrane H+ ATPase 

Yeast PMA1 for plasma membrane ATPase 

Figure 2. N of L.donovani ATPase and 

Rabbit fast twitch skeletal muscle Ca++ ATPas 

Rabbit Ca + Mg dependent Ca++ ATPase mRNA, cc 

Plasmid NR1 mercury resistance (iter) operon. 



>STRATPK S.faecalis K+ ATPase gene, complete cds. 

>SYNCATPSB Cyanobacterium Synechococcus 6301 DKA for AT 

>ECOKDPABC E.coli kdpABC operon coding for Kdp-ATPase p 

>YSPPMA1A S.pombe H+ ATPase gene, complete cds. 

>NEUATPASE F. crassa plasma membrane ATPase gene, comple 

>NEUATPPM Keurospora crassa plasma membrane H+ ATPase 

>YSCPMA1 Yeast PMA1 gene for plasma membrane ATPase 

>JO4004 Leishmania donovani. cation transporting ATP 

>M17889 Figure 2. Nucleotide seguence of L.donovani 

>RATATPA2 Rat Na+,K+ ATPase alpha (+) isoform catalytic 



The KdpB protein of E. coli and possibly the S. 
faecalis K+ ATPase are members of ElE2-ATPases which 
are known to form an aspartyl phosphate intermediate, 
with cyclic transformation of the enzyme between 



phosphorylated and dephosphorylated species. By analogy 
to other ATPases, the phosphorylated Asp residue (D) 
(Furst, et al., .T. Biol. Chem. . 260:50-52 (1985)) was 
identified at position 443 in the MBB51A ATPase. This 
residue is the first of a pentapeptide sequence DKTGT 
that has been conserved in ATPases from bacteria to 
man, and must form an essential element of the' 
catalytic site. Similarly, proline (P) at position 400 
in MBB51A ATPase was found to be an invariant amino 
acid in other ATPases and is predicted to be located in 
a membrane spanning domain. Such membrane buried 
proline residues have been hypothesized to be required 
for the reversible conformational changes necessary for 
the regulation of a transport channel (Brandl, et al., 
Proc. Natl . Acad. Sci. , U.S.A., 83:917-921 (1986)). In 
addition, other sequence motifs believed to be 
functionally important in other ion-motive ATPases were 
also found to be conserved in the MBB51A ATPase. These 
include a Gly (G) (Farley and Faller, J. Biol. Chem., 
260:3899-3901 (1985)) at position 521 and Ala (A) 
(Ohta, et al. , Proc. Natl. Acad. Sci. , U.S.A., 83:2071- 
2075 (1986)) at position 646, and are shown in 
Figure 5. 

Since the MBB51A ATPase was homologous to membrane 
associated ATPases, characterization of the membrane 
associated helices in MBB51A protein was performed by 
computer algorithms. Using a hydropathy profile (Rao, 
et al., Biochem. Bioohvs. Acta. , 869:197-214 (1986)), 
seven transmembrane domains in the MBB51A protein were 
identified and are shown in Table III and Figure 5. 
Nearly the same transmembrane domains were also 
identified using the hydrophobic moment plot (Eisenberg 
et al., J. Mol. Biol. , 179:125-142 (1984)) and are also 
shown in Table III and Figure 5. The average size of 
a transmembrane domain is around 21 residues, because 
21 residues coil into an a-helix approximately the 



thickness of the apolar position of a lipid bilayer (32 
A) . This size of a transmembrane domain is, however, 
flexible within the range of a few amino acids, as 
determined by the functional properties of a given 
membrane-associated protein. The transmembrane domains 
identified in MBB51A protein, range in size from 20-37 
residues. The first six transmembrane domains span the 
membrane only once, as indicated by both the hydropathy 
profile and the hydrophobic moment plot. The seventh 
transmembrane domain may traverse the membrane twice. 
These features along with the membrane buried proline 
(P) at position 400, are in accordance with the channel 
transport functions of ion-motive ATPases, involving a 
reversible change in the conformation of these 
proteins. Such transmembrane domains further define 
the intracellular and extracellular domains of this 
molecule. See Figure 5. 





Table III 




Transmembrane 
Domain in Fig. 5 


Eisenberg 
Method 


Rao & Argos 
Method 


1 


102 - 122 


98 - 125 


2 


129 - 149 


127 - 147 


3 


164 - 184 


164 - 185 


4 


199 - 219 


198 - 220 


5' 


361 - 381 


360 - 382 


6 


387 - 407 


387 - 419 


7 


703 - 723 


695 - 732 



The hydropathy profile of MBB51A protein was nearly 
superimposable over that of S. faecalis K+ ATPase, even 
though the MBB51A ATPase has at the N-terminus, 154 
extra amino acids, which were absent in S. faecalis. 
This clearly puts in evidence the strong evolutionary 
conservation of the broad domain structure between 
these two proteins, making it more likely for the two 
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proteins to have a similar three dimensional structural 
organization. 



Based on the hydropathy profile and secondary structure 
predictions, a schematic model of the MBB51A' ATPase is 
5 presented in Figure 5. This model comprises at least 
seven transmembrane domains which span the membrane, 
once are indicated along with the respective amino acid 
positions in Figure 5. This model further defines 
extracellular and intracellular domains of the MBB51A 

10 protein. Many of the residues which have been shown to 
be functionally important in other ion-motive ATPases 
and are also conserved in the MBB51A protein, are also 
shown. Of these, proline (P) at position 400 is 
membrane-buried whereas as aspartic acid(D) at 443, 

15 glycine (G) at 521 and alanine (A) at 646, face the 
cytoplasm. 

In order to determine whether the gene encoding MBB51A 
ion-motive ATPase is present in other mycobacterial 
strains related or unrelated to BCG, like the virulent 

20 strain M. tuberculosis H37Rv and other non-tuberculous, 
non-pathogenic mycobacteria like M. vaccae and M. 
smeqroatis ,. Southern blot hybridization with genomic DNA 
from the above species was performed, using as probe 
BCG insert DNA from pMBBSIA. As shown in Figure 6, DNA 

25 hybridizable with the pMBBSIA insert DNA was also 
present in M. tuberculosis H37Rv DNA but not in M. 
smegmatis and M. vaccae . This indicated that the M. 
tuberculosis H37Rv homologue of the pMBBSIA gene has a 
similar genetic organization as seen in M. bovis BCG 

3 0 DNA, and is present on a 3.25 kb BamH I fragment. 

The availability of novel Mycobacterium bovis BCG 
and/or Mycobacterium tuberculosis H37Rv antigens make 
it possible to address basic biochemical, 
immunological, diagnostic and therapeutic questions 



-29- 

still unanswered about tuberculosis and Mycobacterium 
tuberculosis . For example, Mycobacterium tuberculosis 
specific antigenic determinants can be used to develop 
simple and specific seroepidemiological tests to screen 
5 human populations. Such serological tests are highly 
specific because of the use of antigenic determinants 
determined by the approaches described above and known' 
to be unique to Mycobacterium tuberculosis H37Rv. Such 
serological tests are useful for early diagnosis of 
10 tuberculosis, thus permitting early treatment and 
limiting transmission of the disease from infected 
individuals to others. 

Resistance to tuberculosis is provided by cell mediated 
immunity. The antigens identified here can be further 
15 used to determine which segments of these antigens are 
recognized by Mycobacterium tuberculosis specific T- 
cells. A mixture of peptides recognized by helper T- 
cells provides a specific skin test antigen for use in 
assessing the immunological status of patients and 

2 0 their contacts. A mixture of such peptides is also 

useful in evaluating rapidly the immunological efficacy 
of candidate vaccines. In addition peptides recognized 
by Mycobacterium tuberculosis specific T-cells can be 
components of a vaccine against the disease. 

25 Knowledge of the complete nucleotide sequence of 
pMBB51A DNA insert provides a rich source of sequence 
information which can be used to design appropriate 
primers for PCR amplification of mycobacterial genomic 
DNA fragments. The ion-motive ATPase of BCG has areas 

30 of heavily conserved sequences (for, e.g., the ATP 
binding site) which are expected to be the same for all 
mycobacterial species and areas of sequence divergence 
(for, e.g., the N-terminal region) which are different 
in different mycobacterial species. Based on this 

3 5 knowledge primers can be designed either from the 
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conserved regions or from the diverged regions to 
identify whether in a given sample the target DNA is 
mycobacterial versus non-mycobacterial , and in case of 
mycobacterial DNA, which mycobacterial species the DNA 
5 belongs. 

Such amplification schemes are useful for the 
development of highly sensitive and specific PCR 
amplification based diagnostic procedures for 
mycobacteria. The observation that the 3.25kb pMBB5lA 

10 DNA insert is present in Mycobacterium tuberculosis 
H37Rv and Mycobacterium bovis BCG and is absent in 
avirulent Mycobacterium vaccae and Mycobac terium 
smeqmatis , which have bearing on other aspects of the 
biological differences between these species, manifest 

15 in terms of virulence, growth characteristics and 
metabolism. 

Recombinant vaccines can also be constructed by 
incorporating the DNA encoding all or part of the 
membrane-associated polypeptides of the invention into 

20 an appropriate vaccine vehicle. For example, all or 
part of the DNA encoding the 79kD Mycobacterium bovis 
BCG protein or a -portion of the protein can be 
incorporated into a vaccine vehicle capable of 
expressing the said DNA. Such a vaccine vehicle could 

2 5 be a virus for, e.g., vaccinia virus, etc., or a 
bacterium, e.g., mycobacteria, Salmonella, Vibrio, 
Bacillus, Yersinia, Bordetella, etc. to produce a 
vaccine capable of conferring long-lasting immunity on 
individuals to whom it is administered. 

30 A special feature of the 79kD BCG ion-motive ATPase is 
that it is a membrane bound antigen. Therefore, it can 
be used to link foreign DNA sequences encoding 
antigenic epitopes (B-cell epitopes or T-cell epitopes) 
of interest, with this gene or a portion of this gene 
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in a manner which causes the foreign epitope to be used 
as an immunogen. Such linkages can be engineered into 
extracellular or intracellular domains of MBB51A 
protein, or into a combination of both types of 
5 domains. Engineering of immunogenic foreign epitopes 
into MBB51A DNA is accomplished by standard recombinant 
DNA methods known to those skilled in the art. Some of, 
these methods involve use of unique restriction sites, 
in vitro mutagenesis and/or PCR-related methods. One 

10 such convenient method involves the use of a unique 
Ndel site at position 1090 in the MBB51A DNA where 
foreign DNA can be inserted. Grafting of epitopes on 
the cell surface induces rapid antibody response by 
virtue of the epitope being well-exposed on the 

15 bacterial cell, which in turn leads to direct 
activation of B cells. In addition, intracellular 
localization of an epitope induces B cell memory and a 
proficient T cell response. Examples of epitopes of 
interest known to be involved in the immune response to 

20 various pathogens include epitopes from E. coli LT 
toxin, foot and mouth disease virus, HIV, cholera 
toxin, etc. 

Thus, the 79 kD antigen is useful in the design of 
recombinant vaccines against different pathogens. Such 

25 vaccines comprise a recombinant vaccine vehicle capable 
of expressing all or part of the 79 kD membrane- 
associated protein of mycobacteria, into which foreign 
epitopes have been engineered, such that the foreign 
epitopes are expressed on the outer surface and/or on 

30 the inner side of the cell membrane, thereby rendering 
the foreign epitopes immunogenic. The vaccine vehicle 
for this purpose may be a cultivable mycobacterium for, 
e.g., BCG. In these applications, the BCG ion-motive 
ATPase gene can be borne on a mycobacterial shuttle 

3 5 vector or alternately the foreign DNA encoding 
antigenic epitopes of the immunogenic polypeptides can 
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be inserted into the mycobacterial genome via 
homologous recombination in the ion-motive ATPase gene 
or random integration. Such a process yields stable 
recombinant mycobacterial strains capable of expressing 
5 on their surface and/or in the cytoplasm antigenic 
sequences of interest, which can, for example, provide 
protection against a variety of infectious pathogens.. 
Targeting of recombinant antigens to the cell-wall is 
attractive not only because of the high immunogenic ity 

10 of mycobacterial cell-walls but, in addition, because 
of concerns with the introduction of a live vaccine in 
populations with a high prevalence of HIV 
seropositivity. Additionally, based on the MBB51A 
protein, a non-living but immunogenic recombinant cell 

15 surface subunit vaccine can also be developed to 
provide a useful alternative to live vaccines. 
Alternately, other bacterial, viral or protozoan 
vaccine vehicles could be transformed to generate such 
recombinant vaccines. Examples of potential vaccine 

2 0 vehicles include vaccinia virus, pox-viruses, 
Salmonella, Yerisinia, Vibrio, Bordetella, Bacillus, 
etc. 

Further, 'using such an approach, multivalent 
recombinant vaccines which allow simultaneous 
25 expression of multiple protective epitopes/antigens of 
different pathogens, could also be designed. 

Equivalents 

Those skilled in the art will recognize, or be able to 
ascertain, using no more than routine experimentation, 
3 0 many equivalents to the specific materials and 
components described specifically herein. Such 
equivalents are intended to be encompassed in the scope 
of the following claims. 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GGATCCCGCG GTCATCGATC GGGTCAAACA CCGCCTCGAC GGGTTCACGC TGGCGCCGCT 60 

GTCCACCGCC GCGGGAGGTG GTGGCCGGCA GCCACGCATC TACTACGGCA CCATCCTGAC 120 

CGGTGACCAA TACCTTCACT GCGAGCGCAC CCGCAACCGG CTGCACCACG AACTCGGCGG 180 

TATGGCCGTC GAAATGGAAG GCGGTGCGGT GGCGCAAATC TGCGCGTCCT TCGATATCCC 240 

ATGGCTGGTC ATTCGCGCGC TCTCCGATCT CGCCGGAGCC GATTCGGGGG TGGACTTCAA 300 

TCGGTTTGTC GGCGAGGTGG CGGCCAGTTC GGCCCGCGTT CTGCTGCGCT TGCTGCCGGT 360 

GTTGACGGCC TGTTGAAGAC GACTATCCGC CGGTGCGTTC ACCGCGTCAG GCGGCTTCGG 420 

TGAGGTGAGT AATTTGGTCA TTAACTTGGT CATGCCGCCG CCGATGTTGA GCGGAGGCCA 480 

CAGGTCGGCC GGAAGTGAGG AGCCACG ATG ACG GCG GCC GTG ACC GGT GAA 531 
Met Thr Ala Ala Val Thr Gly Glu 
1 5 

CAC CAC GCG AGT GTG CAG CGG ATA CAA CTC AGA ATC AGC GGG ATG TCG 579 
His His Ala Ser Val Gin Arg He Gin Leu Arg He Ser Gly Met Ser 
10 15 20 

TGC TCT GCG TGC GCC CAC CGT GTG GAA TCG ACC CTC AAC AAG CTG CCG 627 
Cys Ser Ala Cys Ala His Arg Val Glu Ser Thr Leu Asn Lys Leu Pro 
25 30 35 40 

GGG GTT CGG GCA GCT GTG AAC TTC GGC ACC CGG GTG GCA ACC ATC GAC 675 
Gly Val Arg Ala Ala Val Asn Phe Gly Thr Arg Val Ala Thr He Asp 
45 50 55 

ACC AGC GAG GCG GTC 'GAC GCT GCC GCG CTG TGC CAG GCG GTC CGC CGC 723 
Thr Ser Glu Ala Val Asp Ala Ala Ala Leu Cys Gin Ala Val Arg Arg 
60 65. " 70 

GCG GGC TAT CAG GCC GAT CTG TGC ACG GAT GAC GGT CGG AGC GCG AGT 771 
Ala Gly Tyr Gin Ala Asp Leu Cys Thr Asp Asp Gly Arg Ser Ala Ser 
75 * 80 85 

GAT CCG GAC GCC GAC CAC GCT CGA CAG CTG CTG ATC CGG CTA GCG ATC 819 
Asp Pro Asp Ala Asp His Ala Arg Gin Leu Leu He Arg Leu Ala He 
90 95 100 

GCC GCC GTG CTG TTT GTG CCC GTG GCC GAT CTG TCG GTG ATG TTT GGG 867 
Ala Ala Val Leu Phe Val Pro Val Ala Asp Leu Ser Val Met Phe Gly 
105 110 115 120 

GTC GTG CCT GCC ACG CGC TTC ACC GGC TGG CAG TGG GTG CTA AGC GCG 915 
Val Val Pro Ala Thr Arg Phe Thr Gly Trp Gin Trp Val Leu Ser Ala 
125 130 135 
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CTG GCA CTG CCG GTC GTG ACC TGG GCG GCG TGG CCG TTT CAC CGC GTT 963 
Leu Ala Leu Pro Val Val Thr Trp Ala Ala Trp Pro Phe His Arg Val 
140 145 150 

GCG ATG CGC AAC GCC CGC CAC CAC GCC GCC TCC ATG GAG ACG CTA ATC 1011 
Ala Met Arg Asn Ala Arg His His Ala Ala Ser Met Glu Thr Leu lie 
155 160 165 

TCG GTC GGT ATC ACG GCC GCC ACG ATC TGG TCG CTG TAC ACC GTC TTC 1059 
Ser Val Gly He Thr Ala Ala Thr He Trp Ser Leu Tyr Thr Val Phe 
170 175 180 

GGC AAT CAC TCG CCC ATC GAG CGC AGC GGC ATA TGG CAG GCG CTG CTG 1107 
Gly Asn His Ser Pro He Glu Arg Ser Gly He Trp Gin Ala Leu Leu 
185 190 195 200 

GGA AGC GAT GCT ATT TAT TTC GAG GTC GCG GCG GGT GTC ACG GTG TTC 1155 
Gly Ser Asp Ala He Tyr Phe Glu Val Ala Ala Gly Val Thr Val Phe 
205 210 215 

GTG CTG GTG GGG CGG TAT TTC GAG GCG CGC GCC AAG TCG CAG GCG GGC 1203 
Val Leu Val Gly Arg Tyr Phe Glu Ala Arg Ala Lys Ser Gin Ala Gly 
220 225 230 

AGT GCG CTG AG A GCC TTG GCG GCG CTG AGC GCC AAG GAA GTA GCC GTC 1251 
Ser Ala Leu Arg Ala Leu Ala Ala Leu Ser Ala Lys Glu Val Ala Val 
235 240 245 

CTG CTA CCG GAT GGG TCG GAG ATG GTC ATC CCG GCC GAC GAA CTC AAA 1299 
Leu Leu Pro Asp Gly Ser Glu Met Val He Pro Ala Asp Glu Leu Lys 
250 255 260 

GAA CAG CAG CGC TTC GTG GTG CGT CCA GGG CAG ATA GTT GCC GCC GAC 1347 
Glu Gin Gin Arg Phe Val Val Arg Pro Gly Gin He Val Ala Ala Asp 
265 270 275 280 

GGC CTC GCC GTC GAC GGG TCC GCT GCG GTC GAC ATG AGC GCG ATG ACC 1395 
Gly Leu Ala Val Asp Gly Ser Ala Ala Val Asp Met Ser Ala Met Thr 
285 290 295 

GGC GAG GCC AAA CCG ACC CGG GTG CGT CCG GGG GGG CAG GTC ATC GGC 1443 
Gly Glu Ala Lys Pro Thr Arg Val Arg Pro Gly Gly Gin Val He Gly 
300 305 310 

GGC ACC ACA GTG CTT GAC GGC CGG CTG ATC GTG GAG GCG GCC GCG GTG 1491 
Gly Thr Thr Val Leu Asp Gly Arg Leu He Val Glu Ala Ala Ala Val 
315 320 325 

GGC GCC GAC ACC CAG TTC GCC GGA ATG GTC CGC CTC GTT GAG CAA GCG 1539 
Gly Ala Asp Thr Gin Phe Ala Gly Met Val Arg Leu Val Glu Gin Ala 
330 335 340 

CAG GCG CAA AAG GCC GAC GCA CAG CGA CTA GCC GAC CGG ATC TCC TCG 1587 
Gin Ala Gin Lys Ala Asp Ala Gin Arg Leu Ala Asp Arg He Ser Ser 
345 350 355 360 
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GTG TTT GTT CCC GCT GTG TTG GTT ATC GCG GCA CTA ACC GCA GCC GGA 
Val Phe Val Pro Ala Val Leu Val lie Ala Ala Leu Thr Ala Ala Gly 
365 370 375 

TGG CTA ATC GCC GGG GGA CAA CCC GAC CGT GCC GTC TCG GCC GCA CTC 
Trp Leu lie Ala Gly Gly Gin Pro Asp Arg Ala Val Ser Ala Ala Leu 
380 385 390 

GCC GTG CTT GTC ATC GCC TGC CCG TGT GCC CTG GGG CTG GCG ACT CCG 
Ala Val Leu Val He Ala Cys Pro Cys Ala Leu Gly Leu Ala Thr Pro 
395 400 405 

ACC GCG ATG ATG GTG GCC TCT GGT CGC GGT GCC CAG CTC GGA ATA TTT 
Thr Ala Met Met Val Ala Ser Gly Arg Gly Ala Gin Leu Gly He Phe 
410 415 420 

CTG AAG GGC TAC AAA TCG TTG GAG GCC ACC CGC GCG GTG GAC ACC GTC 
Leu Lys Gly Tyr Lys Ser Leu Glu Ala Thr Arg Ala Val Asp Thr Val 
425 430 435 440 

GTC TTC GAC AAG ACC GGC ACC CTG ACG ACG GGC CGG CTG CAG GTC AGT 
Val Phe Asp Lys Thr Gly Thr Leu Thr Thr Gly Arg Leu Gin Val Ser 
445 450 455 

GCG GTG ACC GCG GCA CCG GGC TGG GAG GCC GAC CAG GTG CTC GCC TTG 
Ala Val Thr Ala Ala Pro Gly Trp Glu Ala Asp Gin Val Leu Ala Leu 
460 465 470 

GCC GCG ACC GTG GAA GCC GCG TCC GAG CAC TCG GTG GCG CTC GCG ATC 
Ala Ala Thr Val Glu Ala Ala Ser Glu His Ser Val Ala Leu Ala He 
475 480 485 

GCC GCG GCA ACG ACT CGG CGA GAC GCG GTC ACC GAC TTT CGC GCC ATA 
Ala Ala Ala Thr Thr Arg Arg Asp Ala Val Thr Asp Phe Arg Ala He 
490 495 500 

CCC GGC CGC GGC GTC AGC GGC ACC GTG TCC GGG CGG GCG GTA CGG , GTG 
Pro Gly Arg Gly Val Ser Gly Thr Val. Ser Gly Arg Ala Val Arg Val 
505 510 515 520 

GGC AAA CCG TCA TGG ATC GGG TCC TCG TCG TGC CAC CCC AAC ATG CGC 
Gly Lys Pro Ser Trp He Gly Ser Ser Ser Cys His Pro Asn Met Arg 
525 530 535 

GCG GCC CGG CGC CAC GCC GAA TCG CTG GGT GAG ACG GCC GTA TTC GTC 
Ala Ala Arg Arg His Ala Glu Ser Leu Gly Glu Thr Ala Val Phe Val 
540 545 550 

GAG GTC GAC GGC GAA CCA TGC GGG GTC ATC GCG GTC GCC GAC GCC GTC 
Glu Val Asp Gly Glu Pro Cys Gly Val He Ala Val Ala Asp Ala Val 
555 560 565 

AAG GAC TCG GCG CGA GAC GCC GTG GCC GCC CTG GCC GAT CGT GGT CTG 
Lys Asp Ser Ala Arg Asp Ala Val Ala Ala Leu Ala Asp Arg Gly Leu 
570 575 580 



r 



CGC ACC ATG CTG TTG ACC GGT GAC AAT CCC GAA TCG GCG GCG GCC GTG 
Arg Thr Met Leu Leu Thr Gly Asp Asn Pro Glu Ser Ala Ala Ala Val 
585 590 595 600 

GCT ACT CGC GTC GGC ATC GAC GAG GTG ATC GCC GAC ATC CTG CCG GAA 
Ala Thr Arg Val Gly He Asp Glu Val He Ala Asp He Leu Pro Glu 
605 610 615 

GGC AAG GTC GAT GTC ATC GAG CAG CTA CGC GAC CGC GGA CAT GTC GTC 
Gly Lys Val Asp Val He Glu Gin Leu Arg Asp Arg Gly His Val Val 
620 625 630 

GCC ATG GTC GGT GAC GGC ATC AAC GAC GGA CCC GCA CTG GCC CGT GCC 
Ala Met Val Gly Asp Gly He Asn Asp Gly Pro Ala Leu Ala Arg Ala 
635 640 645 

GAT CTA GGC ATG GCC ATC GGG CGC GGC ACG GAC GTC GCG ATC GGT GCC 
Asp Leu Gly Met Ala He Gly Arg Gly Thr Asp Val Ala He Gly Ala 
650 655 660 

GCC GAC ATC ATC TTG GTC CGC GAC CAC CTC GAC GTT GTA CCC CTT GCG 
Ala Asp He He Leu Val Arg Asp His Leu Asp Val Val Pro Leu Ala 
665 670 675 680 

CTT GAC CTG GCA AGG GCC ACG ATG CGC ACC GTC AAA CTC AAC ATG GTC 
Leu Asp Leu Ala Arg Ala Thr Met Arg Thr Val Lys Leu Asn Met Val 
685 690 695 

TGG GCA TTC GGA TAC AAC ATC GCC GCG ATT CCC GTC GCC GCT GCC GGA 
Trp Ala Phe Gly Tyr Asn He Ala Ala He Pro Val Ala Ala Ala Gly 
700 705 710 

CTG CTC AAC CCC CTG GTG GCC GGT GCG GCC ATG GCG TTC TCA TCG TTC 
Leu Leu Asn Pro Leu Val Ala Gly Ala Ala Met Ala Phe Ser Ser Phe 
715 720 725 

TTC GTG GTC TCA AAC AGC TTG CGG TTG CGC AAA TTT GGG CGA TAC CCG 
Phe Val Val Ser Asn Ser Leu Arg Leu Arg Lys Phe Gly Arg Tyr Pro 
730 735 740 

CTA GGC TGC GGA ACC GTC GGT GGG CCA CAA ATG ACC GCG CCG TCG TCC 
Leu Gly Cys Gly Thr Val Gly Gly Pro Gin Met Thr Ala Pro Ser Ser 
745 750 755 760 

GCG TGATGCGTTG TCGGGCAACA CGATATCGGG CTCAGCGGCG ACCGCATCCG 
Ala 



GTCTCGGCCG AGGACCAGAG GCGCTTCGCC ACACCATGAT TGCCAGGACC GCGCCGATCA 2900 

CCACCGGCAG ATGAGTCAAA ATCCGCGTGG TGCTGACCGC GCCGGACAGC GCATCCACAA 2960 

TCACATAGCC GGTCAGTATG GCGACGAACG CCGTCAGAAC ACCGGCCAGG CCGGCGGCGG 3020 

CGCTCGGCCA TAGCGCCGCG CCCACCATGA TCACACCGAG CGCAATCGAC CACGACGTGA 3080 
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CTCGTTGAGC AAGTGGGTGC CGGCACCCGT CGGGTGCTGA TGGGTCAGGC CGACGTCTAG 3140 

GCCAAACCCC TGCACGGTGC CCAGGGCGAT CTGCGCGATG CCCACGCACA GCAACGCCCA 3200 

ACGTCGCCAG GTCATCGGTG AATGTTGCCG CCGCGGCGCC CGGCGGATCC 3250 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Thr Ala Ala Val Thr Gly Glu His His Ala Ser Val Gin Arg He 
15 10 15 

Gin Leu Arg He Ser Gly Met Ser Cys Ser Ala Cys Ala His Arg Val 
20 25 30 

Glu Ser Thr Leu Asn Lys Leu Pro Gly Val Arg Ala Ala Val Asn Phe 
35 AO 45 

Gly Thr Arg Val Ala Thr He Asp Thr Ser Glu Ala Val Asp Ala Ala 



Ala Leu Cys Gin Ala Val Arg Arg Ala Gly Tyr Gin Ala Asp Leu Cys 
65 70 75 80 

Thr Asp Asp Gly Arg Ser Ala Ser Asp Pro Asp Ala Asp His Ala Arg 
85 90 95 

Gin Leu Leu He Arg Leu Ala He Ala Ala Val Leu Phe Val Pro Val 
100 105 110 

Ala Asp Leu Ser Val Met Phe Gly Val Val Pro Ala Thr Arg Phe Thr 
115 120 125 

Gly Trp Gin Trp Val Leu Ser Ala Leu Ala Leu Pro Val Val Thr Trp 
130 135 140 

Ala Ala Trp Pro Phe His Arg Val Ala Met Arg Asn Ala Arg His His 
145 150 155 160 

Ala Ala Ser Met Glu Thr Leu He Ser Val Gly He Thr Ala Ala Thr 
165 170 175 

He Trp Ser Leu Tyr Thr Val Phe Gly Asn His Ser Pro He Glu Arg 
180 185 190 



Ser Gly He Trp Gin Ala Leu Leu Gly Ser Asp Ala He Tyr Phe Glu 
195 200 205 



-39- 



Val Ala Ala Gly Val Thr Val Phe Val Leu Val Gly Arg Tyr Phe Glu 
210 215 220 

Ala Arg Ala Lys Ser Gin Ala Gly Ser Ala Leu Arg Ala Leu Ala Ala 
225 230 235 240 

Leu Ser Ala Lys Glu Val Ala Val Leu Leu Pro Asp Gly Ser Glu Met 
245 250 255 

Val lie Pro Ala Asp Glu Leu Lys Glu Gin Gin Arg Phe Val Val Arg 
260 265 270 

Pro Gly Gin He Val Ala Ala Asp Gly Leu Ala Val Asp Gly Ser Ala 
275 280 285 

Ala Val Asp Met Ser Ala Met Thr Gly Glu Ala Lys Pro Thr Arg Val 
290 295 300 

Arg Pro Gly Gly Gin Val He Gly Gly Thr Thr Val Leu Asp Gly Arg 
305 310 315 320 

Leu He Val Glu Ala Ala Ala Val Gly Ala Asp Thr Gin Phe Ala Gly 
325 330 335 

Met Val Arg Leu Val Glu Gin Ala Gin Ala Gin Lys Ala Asp Ala Gin 
340 345 350 

Arg Leu Ala Asp Arg He Ser Ser Val Phe Val Pro Ala Val Leu Val 
355 360 365 

He Ala Ala Leu Thr Ala Ala Gly Trp Leu He Ala Gly Gly Gin Pro 
370 375 380 

Asp Arg Ala Val Ser Ala Ala Leu Ala Val Leu Val He Ala Cys Pro 
385 390 395 400 

Cys Ala Leu Gly Leu Ala Thr Pro Thr Ala Met Met Val Ala Ser Gly 
405 410 415 

Arg Gly Ala Gin Leu Gly He Phe Leu Lys Gly Tyr Lys Ser Leu Glu 
420 425 430 

Ala Thr Arg Ala Val Asp Thr Val Val Phe Asp Lys Thr Gly Thr Leu 
435 440 445 

Thr Thr Gly Arg Leu Gin Val Ser Ala Val Thr Ala Ala Pro Gly Trp 
450 455 460 

Glu Ala Asp Gin Val Leu Ala Leu Ala Ala Thr Val Glu Ala Ala Ser 
465 470 475 480 

Glu His Ser Val Ala Leu Ala He Ala Ala Ala Thr Thr Arg Arg Asp 
485 490 495 



Ala Val Thr Asp Phe Arg Ala He Pro Gly Arg Gly Val Ser Gly Thr 
500 505 510 
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Val Ser Gly Arg Ala Val Arg Val Gly Lys Pro Ser Trp lie Gly Ser 
515 520 525 



Ser Ser Cys His Pro Asn Met Arg Ala Ala Arg Arg His Ala Glu Ser 
530 535 540 



Leu Gly Glu Thr Ala Val Phe Val Glu Val Asp Gly Glu Pro Cys Gly 
545 550 555 560 



Val He Ala Val Ala Asp Ala Val Lys Asp Ser Ala Arg Asp Ala Val 
565 570 575 



Ala Ala Leu Ala Asp Arg Gly Leu Arg Thr Met Leu Leu Thr Gly Asp 
580 585 590 



Asn Pro Glu Ser Ala Ala Ala Val Ala Thr Arg Val Gly He Asp Glu 
595 600 605 



Val He Ala Asp He Leu Pro Glu Gly Lys Val Asp Val He Glu Gin 
610 615 620 



Leu Arg Asp Arg Gly His Val Val Ala Met Val Gly Asp Gly lie Asn 
625 630 635 640 



Asp Gly Pro Ala Leu Ala Arg Ala Asp Leu Gly Met Ala lie Gly Arg 
645 650 655 



Gly Thr Asp Val Ala He Gly Ala Ala Asp He He Leu Val Arg Asp 
660 665 670 



His Leu Asp Val Val Pro Leu Ala Leu Asp Leu Ala Arg Ala Thr Met 
675 680 685 



Arg Thr Val Lys Leu Asn Met Val Trp Ala Phe Gly Tyr Asn He Ala 
690 695 700 



Ala He Pro Val Ala Ala Ala Gly Leu Leu Asn Pro Leu Val Ala Gly 
705 710 715 - 720 



Ala Ala Met Ala Phe Ser Ser Phe Phe Val Val Ser Asn Ser Leu Arg 
725 730 735 



Leu Arg Lys Phe Gly Arg Tyr Pro Leu Gly Cys Gly Thr Val Gly Gly 
740 745 750 



Pro Gin Met Thr Ala Pro Ser Ser Ala 
755 760 



WHAT IS CLAIMED IS : 
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1. Composition comprising recombinant nucleic acid 
encoding all or part of a membrane-associated 
polypeptide of a mycobacterium, wherein said 

5 mycobacterium is capable of inducing an immune response 
that is detectable with all or part of said membrane- 
associated polypeptide. 

2. The composition of Claim 1 wherein said 
mycobacterium is selected from the group consisting of 

10 M. bovis, M. tuberculosis , M. leprae , M. af ricanum , and 
M. microti, M. avium , M. intracellular and M. 
scrofulaceum . 

3. The composition of Claim 1 wherein said 
mycobacterium is M. bovis BCG. 

15 4. The composition of Claim 3 wherein said membrane- 
associated polypeptide comprises an ion-motive ATPase. 

5. The composition of Claim 4 wherein said ATPase has 
a deduced molecular weight of about 79kD. 

6. The composition of Claim 1 wherein said membrane- 
20 associated polypeptide is encoded by a DNA sequence 

capable of hybridizing with nucleic acid containing all 
or part of the DNA SEQUENCE ID NO: 1. 

7. The composition of Claim 6 wherein said nucleic 
acid encodes at least an extracellular domain of said 

25 membrane-associated polypeptide. 



8. The composition of Claim 6 wherein said nucleic 
acid encodes at least an intracellular domain of said 
membrane-associated polypeptide. 
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9. The composition of Claim 6 wherein said nucleic 
acid encodes at least one transmembrane domain of said 
membrane-associated polypeptide. 

10. The composition of Claim 9 wherein said nucleic 
5 acid encodes a chimeric polypeptide comprising said at 

least one transmembrane domain and an immunogenic 
polypeptide. 

11. Composition comprising all or part of a membrane- 
associated polypeptide of a mycobacterium, wherein said 

10 mycobacterium is capable of inducing an immune response 
that is detectable with all or part of said membrane- 
associated polypeptide. 

12. The composition of Claim 11 wherein said 
mycobacterium is selected from the group consisting of 

15 M. bovis , M. tuberculosis , M. leprae , M. af ricanum , and 
M. microti , M. arium , M. intracellular and M. 
scrofulaceum . 

13. The composition of Claim 11 wherein said 
mycobacterium is M. bovis BCG. 

20 14. The composition of Claim 13 wherein said membrane- 
associated polypeptide comprises an ion-motive ATPase. 

15. The composition of Claim 14 wherein said ATPase 
has a deduced molecular weight of about 79kD. 

16. The composition of Claim 11 wherein said membrane- 
25 associated polypeptide is encoded by a nucleic acid 

capable of hybridizing with a nucleic acid encoding all 
or part of DNA SEQUENCE ID N0:1. 
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17. The composition of Claim 16 wherein said 
polypeptide comprises at least an extracellular domain 
of said membrane-associated polypeptide. 

18. The composition of Claim 16 wherein said 
5 polypeptide comprises at least an intracellular domain 

of said membrane-associated polypeptide. 

19. The composition of Claim 16 wherein said 
polypeptide comprises at least one transmembrane domain 
of said membrane-associated polypeptide. 

10 20. The composition of Claim 19 wherein said 
polypeptide comprises a chimeric polypeptide comprising 
said at least one transmembrane domain and an 
immunogenic polypeptide. 

21. A vaccine comprising all or part of a membrane- 
15 associated polypeptide of a mycobacterium or 
expressible nucleic acid encoding all or part of said 
polypeptide, in a recombinant vaccine vehicle capable 
of expressing said DNA, wherein the vaccine vehicle 
comprises a virus or a bacterium. 

20 22. The vaccine of Claim 21 wherein said membrane- 
associated polypeptide is an ion-motive ATPase of a 
mycobacterium. 

23. Nucleic acid comprising a promoter sequence from 
an ion-motive ATPase of a mycobacterium. 
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ABSTRACT OF THE DISCLOSURE 

Nucleic acid encoding four novel immunodeterminant 
protein antigens of M. bovis BCG, which is a vaccine 
strain for tuberculosis, have been isolated. These 
5 genes were isolated as immunoreactive recombinant 
clones from a genomic library of M. bovis BCG DNA», 
constructed in pBR322 vector, and screened with sera 
collected from tuberculosis patients. The BCG DNA 
insert of one of the recombinants, -pMBB51A, which 

10 expressed an antigen of Mr 90 kD, was sequenced 
completely and an ORF encoding 7 61 amino acids encoding 
a protein of deduced molecular weight 79 kD, was 
identified. This gene was identified to encode a 
membrane bound, ion-motive ATPase of M. bovis BCG. The 

15 approach described here can be used to identify 
immunogens of mycobacteria. In addition, the well- 
characterized M. bovis BCG antigens can be used in the 
prevention, diagnosis and treatment of tuberculosis. 
The 79 kD antigen is also useful in the design of 

20 recombinant vaccines against different pathogens. The 
sequence of the 79 kD membrane-associated polypeptides 
also are useful for the development of specific PCR 
amplification based diagnostic procedures for the 
detection of mycobacteria. Also, the promoter of the 

25 79 kD antigen is useful for expressing homologous 
and/or heterologous antigens in mycobacteria. 
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